Amendments to the Claims: 

This listing of claims will replace aU prior versions, and listings, of claims in the application: 

Listing of Claims; 

1 . (Currently amended) A method, in a data processing system, for parsing Eastern Asian language 
character streams, the method comprising: 

receiving a corpus of word-based parse trees; 

converting the corpus of word-based parse trees into a corpus of character-based parse trees; and 
training a character-based p arser using the corpus of character-based parse trees , wherein the 

character-based parser is used at a character level, and wherein the character-based parser does not require 

a separate word-segmenter . 

2. (Original) The method of claim 1, wherein each word-based parse tree in the corpus of word- 
based parse trees includes a word tag for each word in the word-based parse tree. 

3. (Original) The method of claim 2, wherein converting the corpus of word based parse trees 
includes assigning a word position tag to each character in the character-based parse tree based on the 
word tag for each word in the word-based parse tree. 

4. (Original) The method of claim 3, wherein the word position tag is one of a begiiming tag, a 
middle tag, and an end tag. 

5. (Original) The method of claim 1, wherein training the parser includes forming a model. 

6. (Original) The method of claim 5, further comprising: 

providing the model to a decoder, wherein the decoder parses Eastern Asian language character 
streams at a character level using the model. 

7. (Original) The method of claim 6, further comprising: 

receiving a test sentence, wherein the test sentence is an Eastern Asian language character stream; 

and 

parsing the test sentence using the decoder to form one or more character-based parse trees. 
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8. (Original) The method of claim 1, wherein training the parser includes training the parser using 
maximum-entropy method. 



9. (Original) The method of claim 1, wherein the Eastern Asian language is one of Chinese, 
Japanese, and Korean. 

10. (Original) The method of claim 1, wherein the corpus of word-based parse trees is a Chinese 
Treebank. 

1 1 . (Currently amended) An apparatus for parsing Eastern Asian language character streams, the 
apparatus comprising: 

means for receiving a corpus of word-based parse trees; 

means for converting the corpus of word-based parse trees into a corpus of character-based parse 
trees; and 

means for training a character-based p arser using the corpus of character-based parse treeSj^ 
wherein the character-based parser is used at a character level, and wherein the character-based parser 
does not require a separate word-segmenter. 

12. (Currently amended) A recordable-type medium encoding a c omputer program product74H-a 
computor roadablo medium, f or parsing Eastern Asian language character streams, the computer program 
product comprising: 

instructions for receiving a corpus of word-based parse trees; 
instructions for converting the corpus of word-based parse trees into a corpus of character-based parse 
trees; and 

instructions for training a character-based p arser using the corpus of character-based parse treesj^ 
wherein the character-based parser is used at a character level, and wherein the character-based parser 
does not require a separate word-segmenter . 

13. (Currently amended) The recordable-type medium encoding the c omputer program product of 
claim 12, wherein each word-based parse tree in the corpus of word-based parse trees includes a word tag 
for each word in the word-based parse tree. 
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1 4. (Currently amended) The recordable-tvpe medium encoding the c omputer program product of 
claim 13, wherein the instructions for converting the corpus of word based parse trees includes 
instructions for assigning a word position tag to each character in the character-based parse tree based on 
the word tag for each word in the word-based parse tree. 

15. (Currently amended) The recordable-type medium encoding the c omputer program product of 
claim 14, wherein the word position tag is one of a beginning tag, a middle tag, and an end tag. 

16. (Currently amended) The recordable-type medium encoding the computer program product of 
claim 12, wherein the instructions for training the parser includes instructions for forming a model. 

1 7 . (Currently amended) The recordable-type medium encoding the c omputer program product of 
claim 16, further comprising: 

instructions for providing the model to a decoder, wherein the decoder parses Eastern Asian 
language character streams at a character level using the model. 

1 8. (Currently amended) The recordable-type medium encoding the c omputer program product of 
claim 17, further comprising: 

instructions for receiving an input sentence, wherein the input sentence is an Eastern Asian 
language character stream; and 

instructions for parsing the input sentence using the decoder to form one or more character-based 
parse trees. 

19. (Currently amended) The recordable-type medium encoding the computer program product of 
claim 12, wherein the instructions for training the parser includes instructions for training the parser using 
maximum-entropy method. 

20. (Currently amended) The recordable-type medium encoding the computer program product of 
claim 12, wherein the Eastern Asian language is one of Chinese, Japanese, and Korean. 
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